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Using particle system methodologies we study the propagation of 
financial distress in a network of firms facing credit risk. We investi- 
gate the phenomenon of a credit crisis and quantify the losses that 
a bank may suffer in a large credit portfolio. Applying a large devi- 
ation principle we compute the limiting distributions of the system 
and determine the time evolution of the credit quality indicators of 
the firms, deriving moreover the dynamics of a global financial health 
indicator. We finally describe a suitable version of the "Central Limit 
Theorem" useful to study large portfolio losses. Simulation results are 
provided as well as applications to portfolio loss distribution analysis. 

1. Introduction. 

1.1. General aspects. The main purpose of this paper is to describe prop- 
agation of financial distress in a network of firms linked by business rela- 
tionships. Once the model for financial contagion has been described, we 
quantify the impact of contagion on the losses suffered by a financial insti- 
tution holding a large portfolio with positions issued by the firms. 

A firm experiencing financial distress may affect the credit quality of 
business partners (via direct contagion) as well as of firms in the same sector 
(due to an information effect). 

We refer to direct contagion when the actors on the market are linked by 
some direct partner relationship (e.g., firms in a borrowing-lending network). 
Reduced-form models for direct contagion can be found — among others — in 
Jarrow and Yu [27] for counterparty risk, Davis and Lo [13] for infectious 
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default, Kiyotaki and Moore [28], where a model of credit chain obligations 
leading to default cascade is considered and Giesecke and Weber [23] for a 
particle system approach. Concerning the banking sector, a microeconomic 
liquidity equilibrium is analyzed by Allen and Gale [1] . 

Information effects are considered in information- driven default models; 
here the idea is that the probability of default of each obligor is influenced 
by a "not perfectly" observable macroeconomic variable, sometimes also 
referred to as frailty. This dependence increases the correlation between the 
default events. For further discussions on this point see Schdnbucher [33] as 
well as DufEe et al. [16] and Collin-Dufresne et al. [7]. 

1.2. Purpose and modeling aspects. We propose in this paper a direct 
contagion model which is constructed in a general modeling framework 
where information effects could also be included. In addition to modeling 
contagion, with the approach that we shall develop we intend also to find 
a way to explain what is usually referred to as the clustering of defaults 
(or credit crises), meaning that there is evidence — looking at real data — of 
periods in which many firms end up in financial distress in a short time. 
A standard methodology to reproduce this real-world effect is to rely on 
macroeconomic factors as indicators of business cycles. These factor models 
seem to explain a large part of the variability of the default rates. What 
these models do not explain is above all clustering: as Jarrow and Yu in 
[27] argue, ll A default intensity that depends linearly on a set of smoothly 
varying macroeconomic variables is unlikely to account for the clustering of 
defaults around an economic recession.''' 

A second issue that we would like to capture is — in some sense — more 
"fundamental" and refers to the nature of a credit crisis. We shall propose a 
model where the general "health" of the system is described by endogenous 
financial indicators, endogenous in the sense that its dynamics depends on 
the evolution of the variables of the system. Our aim is to show how a credit 
crisis can be described as a "microeconomic" phenomenon, driven by the 
propagation of the financial distress through the obligors. 

Our model is to be considered within the class of reduced-form models 
and is based on interacting intensities. The probability of having a default 
somewhere in the network depends also on the state of the other obligors. 
The first papers on interacting intensities appear to be those by Jarrow and 
Yu [27], and Davis and Lo [13] on infectious default. 

In our perspective the idea of a network where agents interact leads natu- 
rally to the literature of particle systems used in statistical mechanics. This 
point of view is quite new in the world of financial mathematics especially 
when dealing with credit risk management. Among some very recent papers 
we would like to mention the works by Giesecke and Weber [23], and [24] 
for an interacting particle approach, the papers by Frey and Backhaus [19] 
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on credit derivatives pricing and Horst [26] on cascade processes. More de- 
veloped is the use of particle and dynamical systems in the literature on 
financial market modeling. It has been shown that some of these models 
have "thermodynamic limits" that exhibit similar features compared to the 
limiting distributions (in particular when looking at the tails) of market 
returns time series. For a discussion on financial market modeling see the 
survey by Cont [9] and the paper by Follmer [18] that contains an inspiring 
discussion on interacting agents. 

Another reason to focus on particle systems is that they allow to study 
a credit crisis as a microeconomic phenomenon and so provide the means 
to explain phenomena such as default clustering that are difficult to explain 
by other means. In fact, interacting particle systems may exhibit what is 
called phase transition in the sense that in the limit, when the number N of 
particles goes to infinity, the dynamics may have multiple stable equilibria. 
The effects of phase transition for the system with finite N can be seen 
on different time-scales. On a long time-scale we expect to observe what 
is usually meant by metastability in statistical mechanics: the system may 
spend a very long time in a small region of the state space around a stable 
equilibrium of the limiting dynamics and then switch relatively quickly to 
another region around a different stable equilibrium. This switch, of which 
the rigorous analysis will be postponed to future work, occurs on a time-scale 
proportional to e kN for a suitable k > 0, that could be unrealistic for financial 
applications. The model we propose exhibits, however, a different feature 
that can be interpreted as a credit crisis. For certain values of the initial 
condition the system is driven toward a symmetric equilibrium, in which half 
of the firms are in good financial health. After a certain time that depends 
on the initial state, the system is "captured" by an unstable direction of this 
symmetric equilibrium, and moves toward a stable asymmetric equilibrium; 
during the transition to the asymmetric equilibrium, the volatility of the 
system increases sharply, before decaying to a stationary value. All this 
occurs at a time-scale of order O(l) (i.e., the time-scale does not depend 
on N). 

1.3. Financial application. As already mentioned in Section 1.1, the ap- 
plied financial aim of this paper is to quantify the impact of contagion on 
the losses suffered by a financial institution holding a large portfolio with 
positions issued by the firms. In particular, we aim at obtaining a dynamic 
description of a risky portfolio in the context of our contagion model. The 
standard literature on risk management usually focuses on static models 
allowing to compute the distribution of a risky portfolio over a given fixed 
time-horizon T. For a recent paper that introduces a discussion relating to 
static and dynamic models see Dembo, Deuschel and Duffie [14]. 
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We shall consider large homogeneous portfolios. Attention to large ho- 
mogeneous portfolios becomes crucial when looking at portfolios with many 
small entries. Suppose a bank is holding a credit portfolio with iV = 10,000 
open positions with small firms; it is quite costly to simulate the dynamics 
of each single firm, taking into account all business ties. If the firms are 
supposed to be exchangeable, in the sense that the losses that they may 
cause to the bank in case of financial distress depend on the single firm only 
via its financial state indicator, it is worth evaluating a homogeneous model 
where N goes to infinity and then to look for "large-iV" approximations. 
This apparently restrictive assumption may be easily relaxed by considering 
many homogeneous groups within the network (in this context see also [19]). 

We shall provide formulas to compute quantiles of the probability of excess 
losses in the context of our contagion model; we shall in fact determine the 
entire portfolio loss distribution. Other credit risk related quantities can also 
be computed, as we shall briefly mention at the end of Section 4. 

We conclude this section by noticing that in recent years the challenging 
issue of describing the time evolution of the loss process connected with port- 
folios of many obligors has received more and more attention. Applications 
can be found, for example, in the literature dealing with pricing and hedging 
of risky derivatives such as CDOs, namely Collateralized Debt Obligations 
(see, e.g., the papers by Prey and Backhaus [20], Giesecke and Goldberg [22] 
and Schonbucher [34]). 

We believe that our paper may be considered as an original contribution 
to the modeling of portfolio loss dynamics: to our knowledge, this is the first 
attempt to apply large deviations on path spaces (i.e., in a dynamic fashion) 
for finance or credit management purposes. For a survey on existing large 
deviations methods applied to finance and credit risk see Pham [31]. 

1.4. Methodology. Our interacting particle system, which describes the 
firms in the network, will be Markovian, but nonreversible. Usually, when 
the dynamics admit a reversible distribution, this distribution can be found 
explicitly by the detailed balance condition [see (6) below]. In the model we 
propose in this paper, and that will be introduced in Section 2, no reversible 
distribution exists. This makes it difficult to find an explicit formula for the 
stationary distribution. For this reason we have not pursued the "static" 
approach consisting in studying the N — ► +oo asymptotics of the stationary 
distribution. We shall rather proceed in a way that in addition allows to 
obtain nonequilibrium properties of the system dynamics. First we study the 
./V — ► oo limiting distributions on the path space. To this effect we shall derive 
an appropriate law of large numbers based on a large deviations principle. 
We then study the possible equilibria of the limiting dynamics. This study 
leads to considering different domains of attraction corresponding to each 
of the stable equilibria. Finally, we study the finite volume approximations 
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(for finite but large N) of the limiting distribution via a suitable version of 
the Central Limit Theorem that allows to analyze the fluctuations around 
this limit. As a consequence of the different domains of attraction of the 
limiting dynamics one obtains for finite N and on ordinary time-scales an 
interesting behavior of the system that has an equally interesting financial 
interpretation, which was already alluded to at the end of Section 1.2. This 
behavior will also be documented by simulation results. 

Our interaction model is characterized by two parameters indicating the 
strength of the interactions. Phase transition occurs in an open subset of the 
parameter space, whose boundary is a smooth curve (critical curve) that we 
determine explicitly. We shall derive the Central Limit Theorem in a fixed 
time-interval [0, T] for every value of the parameters. We do not consider in 
this paper the Central Limit Theorem in the case when the time-horizon T 
depends on ./V itself; it will be dealt with elsewhere. When T grows with ./V 
we expect the behavior to depend more strongly on the parameters. In the 
case when the parameters belong to the uniqueness region (the complement 
of the closure of the region where phase transition occurs) we believe that 
the Central Limit Theorem should be uniform in time, while in the phase 
transition region the Central Limit Theorem should extend to any time-scale 
strictly smaller than the metastability scale (which grows exponentially in 
N). On the critical curve one expects a critical time-scale (of order yN) at 
which large and non-Gaussian fluctuations are observed. 

For real applications the interaction parameters have to be calibrated 
to market data. In this paper we do not consider the issue of calibration 
but rather present some simulation results of the loss behavior for different 
values of the parameters. 

The outline of the paper is as follows. The more detailed description of 
the model will be given in Section 2. Section 3 is devoted to stating the 
main limit theorems on the stochastic dynamics, in particular a law of large 
numbers and a central limit theorem. The financial application, in particular 
to large portfolio losses with specific examples, will be described in Section 
4. Section 5 contains the proofs of the results stated in Sections 3 and 4. A 
Conclusions section completes the paper. 

2. The model. 

2.1. A mean-field model. In this section we describe a mean-field interac- 
tion model. What characterizes a mean-field model — within the large class 
of particle systems — is the absence of a "geometry" in the configuration 
space, meaning that each particle interacts with all the others in the same 
way. This "homogeneity" assumption is clearly rather restrictive; neverthe- 
less this kind of framework has been proposed by authors in different fields. 
Among the others we quote Prey and Backhaus [19] for a credit risk model 
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and Brock and Durlauf [4] for their contribution to the Social Interaction 
models. These models are used to capture the interaction of agents when 
facing any kind of decision problems. As pointed out in [19], if we are con- 
sidering a large group of firms belonging to the same sector (e.g., the energy 
sector), then the ability of generating cash flows and the capacity of rais- 
ing capital from financial institutions may be considered as "homogeneous" 
characteristics within the group (and this assumption is quite common in 
practice) ; we moreover recall that the final aim of this work is to study ag- 
gregate quantities for a large economy such as the expected global health 
of the system and large portfolio losses as well as related quantities. These 
considerations allow us to avoid the (costly) operation of modeling a fully 
heterogeneous set of firms. 

Other approaches, different from the mean-field one, have also been pro- 
posed in the literature: Giesecke and Weber have chosen a local-interaction 
model (the Voter model 1 ) assuming that each particle interacts with a fixed 
number d of neighbors; it may be argued that the hypothesis that each firm 
has the same (constant) number of partners is rather unrealistic. Cont and 
Bouchaud (see [10]) suggest a random graph approach, meaning that the 
connections are randomly generated with some distribution functions. 

The philosophy behind our model can be summarized as follows: 

• We introduce only a small number of variables that, however, have a 
simple economic interpretation. 

• We define dynamic rules that describe interaction between the variables. 

• We keep the model as simple as possible; in particular, as we shall see, 
we define it in such a way that it has some symmetry properties. On one 
hand this may make the model less adherent to reality; on the other it 
leads to exact computations and still allows to show what basic features 
of the model produce phenomena such as clustering of defaults, phase 
transition, etc. More generally, it allows to show how, contrary to most 
models relying on macroeconomic factors, the "health" of the system can 
here be described by endogenous financial indicators so that a credit crisis 
can be viewed as a microeconomic phenomenon. 

Consider a network of N firms. The state of each firm is identified by two 
variables, that will be denoted by a and uo [(cj,u;j) is the state of the ith 
firm]. The variable a may be interpreted as the rating class indicator: a low 
value reflects a bad rating class, that is, a higher probability of not being 
able to pay back obligations. The variable co represents a more fundamental 
indicator of the financial health of the firm and is typically not directly 



1 The Voter model assumes — roughly speaking — that the variable <n 6 { — 1,1} is more 
likely to take a positive value if the majority of the nearest neighbors of i are in a positive 
state and vice versa. 
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observable. It could, for example, be a liquidity indicator as in Giesecke 
and Weber [23] or the sign of the cash balances as in Cetin et al. [5]. The 
important fact is that, while there is usually a strong interaction between 
(jj and Ui , the nonobservability of u makes it reasonable to assume that u>i 
cannot directly influence the rating indicators aj for j/i. 

In this paper we assume that the two indicators crj , Ui can only take two 
values, that we label by 1 ("good" financial state) and —1 (financial distress). 
In the case of portfolios consisting of defaultable bonds, we may then refer to 
the rating class corresponding to a = — 1 also as "speculative grade" and that 
corresponding to a = +1 as "investment grade." Although the restriction to 
only two possible values may appear to be unrealistic, we believe that many 
aspects of the qualitative behavior of the system do not really depend on 
this choice. On the other hand, modulo having more complex formulae, the 
results below can be easily extended to the case when these variables take 
an arbitrary finite number of values. 

In our binary variable model we are naturally led to an interacting in- 
tensity model, where we have to specify the intensities or rates (inverse of 
the average waiting times) at which the transitions crj h- > —ai and t— > — a>j 
take place. If we neglect direct interactions between the Wj's, and we make 
the mean-field assumption that the interaction between different firms only 
depends on the value of the global financial health indicator 

1 N 

i=l 

we are led to consider intensities of the form 

ai i— > —ai with intensity a(ai,uji,mj 1 ), 

(1) 

u>i i— > —u)i with intensity b(ai,Ui,mjf), 

where a(-,-,-) and &(-,-,•) are given functions. Since both financial health 
and distress tend to propagate, we assume that a{— l,Ui,mjr) is increasing 
in both uji and mjj, and a(l,uji,mjj) is decreasing. Similarly, b(ai,—l,mj f ) 
and b(ai,l,mjj) should be respectively increasing and decreasing in their 
variables. 

The next simplifying assumption is that the intensity a(ai,uii,m^) is 
actually independent of mj^, that is, of the form a(ai,uii). Although this 
assumption amounts to a rather mild computational simplification, it allows 
to show that aggregate behavior (phase transition, etc.) may occur even in 
absence of a direct interaction between rating indicators. 

Although a model of this generality could be fully analyzed, we make the 
following choice of the intensities, inspired by spin-glass systems, to make 
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the model depend on only a few parameters: 

(Ti i — ► — <Tj with intensity e~^' 7iU ' i , 

(2) 

w.j i — ► -Wj with intensity e" 7a;i m ^ . 

Here /? and 7 are positive parameters which indicate the strength of the 
corresponding interaction. Put differently, we are considering a continuous- 
time Markov chain on { — 1,1} 2N with the following infinitesimal generator: 

N N 

(3) Lf(a,u) = ]T e-^Vff&u) + £ e"^ m - V$7(<r, w), 
i=i j=i 

where Vff(a,u) = f(a l ,ui) — f(a,ui) (analogously for Vf ), and where the 
jth component of cr* is 

-di, for j = i. 

The rest of the paper is devoted to a detailed analysis of the above model. 
We conclude this subsection with some general remarks on the model we 
have just defined. 

Remark 2.1. 

• We have viewed the variable a as a rating class indicator. Contrary to 
the standard models for rating class transitions, our rating indicator a 
is not Markov by itself, but it is Markov only if paired with uj. This 
property is in line with empirical data and with recent research in the 
field of credit migration models. It is in fact well documented that real 
data of credit migration between rating classes exhibit a "non-Markovian" 
behavior. For a discussion on this topic see, for example, Christensen et 
al. [6]. In that paper the authors propose a hidden Markov process to 
model credit migration. The basic criticism to Markovianity is the fact 
that the probability of being downgraded is higher for firms that have 
been just downgraded. In order to capture this issue, the authors consider 
an "excited" rating state (e.g., B* from which there is a higher probability 
to be downgraded compared to the standard state B). This point of view is 
not far from ours, even though the mechanism of the transition is different. 
The downgrade to a = — 1 is higher when {a = l,u = —1) compared to 
(<7 = l,u; = l). 

• In our model, unlike other rating class models, we do not introduce a de- 
fault state for firms; it could be identified as a value for the pair (a, to) for 
which the corresponding intensities are identically zero, that is, a(<7, uj, mjr) 
b(a,uj,mjr) = for all values of m^. This would have the effect of intro- 
ducing a "trap state" for the system, changing drastically the long-time 
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behavior. Even in case of defaultable firms, however, our model could be 
meaningful up to a time-scale in which the fraction of defaulted firms is 
small. 

• With a choice of the intensities as in (2) we introduce a form of symmetry 
in our model, whereby the values a = — 1 and a = +1 for the rating indica- 
tor turn out to be equally likely. One could, however, modify the model in 
order to make the value a = — 1 less (more) likely than the value a = +1 
and this could, for example, be achieved by letting the intensity for loi 
be of the form e"^" 1 ^- 1 , where is an increasing, nonlinear and noneven 
function. A possible "prototype" choice would be (p(x) = j(x — K) + + 5 
with 7,5 > and K G (0,1). Note that with this latter choice we have 
4> > so that the value uji = +1 (and hence also <7j = +1) becomes more 
likely. Such an asymmetric setup might be more realistic in financial appli- 
cations but, besides leading to more complicated derivations, it depends 
also on the specific application at hand. Since, as already mentioned, we 
want to study a model that is as simple as possible and yet capable of 
producing the basic features of interest, in this paper we concentrate on 
the "symmetric choice" in (2). The large deviation approach to the Law 
of Large Numbers developed in Sections 3.1 and 3.2 can be adapted to 
the asymmetric setup (see Remark 3.5) with no essential difference. On 
the other hand, our proof of the Central Limit Theorem in Section 3.3 
may require more regularity on the function (f) above. We leave this point 
for further investigation. 

2.2. Invariant measures and nonreversibility. Mean-field models as the 
one we propose in this paper have already appeared, mostly in the statistical 
mechanics literature (see in particular [12] and [8], from which we borrow 
many of the mathematical tools). However, unlike what happens for the 
models in the cited references, we now show that our model is nonreversible. 
This implies that an explicit formula for the stationary distribution and its 
N — > oo asymptotics is not available. It is thus appropriate to follow a more 
specifically dynamic approach to understand the long-time behavior of the 
system. As already mentioned, we shall thus first study the N — > oo limit of 
the dynamics of the system, obtaining limit evolution equations. Then we 
study the equilibria of these equations. This is not necessarily equivalent to 
studying the N — > oo properties of the stationary distribution fijy. However, 
as we shall show later in this paper, this provides rather sharp information 
on how the system behaves for t and N large. 

The operator L given in (3) defines an irreducible, finite-state Markov 
chain. It follows that the process admits a unique stationary distribution 
/iTV, that is, a distribution such that, for each function / on the configuration 
space of (a,uj_), 



(4) 
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This distribution reflects the long-time behavior of the system, in the 
sense that, for each / and any initial distribution, 

lim E[f(a(t),u(t))] = y2vN(°L,u)f(a,u). 

t— >+oo z — ' 

<r,o; 

The stationarity condition (4) is equivalent to 

N 

i=i 

(5) 

N 

+ Y,^n(<L, J)e^ m N - fi N (a,ui)e-^ m N] = 

i=l 

for every a,u G {—1, 1}^. 

Simpler sufficient conditions for stationarity are the so-called detailed bal- 
ance conditions. We say that a probability v on {-1,1} 2N satisfies the de- 
tailed balance condition for the generator L if 

w)e /?<T4t * = v{g_,ui)e-^ 1 and 

(6) 

v(a,tJ)ei" irr %i = v(a,ui)e-^ m N 

for every a,u. When the detailed balance conditions (6) hold, we say the sys- 
tem is reversible: the stationary Markov chain with generator L and marginal 
law v has a distribution which is left invariant by time-reversal. In the case 
(6) admits a solution, they usually allow to derive the stationary distribution 
explicitly. This is not the case in our model. We have in fact: 

Proposition 2.2. The detailed balance equations (6) admit no solution, 
except at most for one specific value of N . 

Proof. By way of contradiction, assume a solution v of (6) exists. Then 
one easily obtains 

which implies 

V?Vf log v(a,u) = 4/30-m, 
V°Vflogv(a,ui)=4N- 1 1 u Jl o- l . 
This is not possible since VfVf\ogu(a,u) = V?V?logv(a,uf). □ 
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3. Main results: law of large numbers and Central Limit Theorem. In 

this section we state the results concerning the dynamics of the system 
(ai[0,T],oji[0,T])^L 1 in the limit as N — > oo. Note that for each value of 
N we are considering a Markov process with generator (3). Thus, it would 

be more accurate to denote by (<7 t - [0, T], [0, T]) the trajectories of the 
variables related to the ith. firm in the system with N firms. For convenience, 
we consider a fixed probability space (O,^ 7 , P) where all V([0, T])- valued 

processes a^ N \o,T], u!> N '[Q,T] are defined, and the following conditions are 
satisfied: 

• for each N > 1 the processes (c J - A ^[0,T],w J - A ^[0,T])^ 1 are Markov pro- 
cesses with infinitesimal generator (3); 

• for each N > 1 the { — 1, l} 2 -valued random variables (cr^ N \o) , LO^ N \o))fL 1 
are independent and identically distributed with an assigned law A. 

This last assumption on the initial distribution is stronger than what we 
actually need to prove the results below; however, it allows to avoid some 
technical aspects in the proof, that we consider not essential for the purposes 
of the paper. The other point, concerning the fact of realizing all processes 
in the same probability space, is not a restriction; we are not making any 
assumption on the dependence of processes with different values of N, so 
this joint realization is always possible. Its main purpose is to allow to state 
a strong law of large numbers. 

Our approach proceeds according to the following three steps, to which 
correspond the three subsections below, namely: 

(i) look for the limit dynamics of the system (N — > oo); 

(ii) study the equilibria of the limiting dynamics; 

(iii) describe the "finite volume approximations" (for large but finite N) 
via a central limit- type result. 

3.1. Deterministic limit: large deviations and law of large numbers. In 
what follows 2?([0,T]) denotes the space of right-continuous, piecewise con- 
stant functions [0,T] — > { — 1,1}, endowed with the Skorohod topology (see 
[17]). Let (cr i [0,r],a;i[0,T])^ 1 G V([0,T}) 2N denote a path of the process in 
the time-interval [0, T] for a generic T > 0. If f{pi [0, T] , u>i [0, T] ) is a function 
of the trajectory of the variables related to a single firm, one is interested in 
the asymptotic behavior of empirical averages of the form 




where pn is the sequence of empirical measures 




i=l 
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We may think of pat as a (random) element of j\Ai(T>([0, T]) x V([0,T])), 
the space of probability measures on X>([0,T]) x 2?([0,T]) endowed with the 
weak convergence topology. 

Our first aim is to determine the limit of / f dpjy as N — > oo, for / con- 
tinuous and bounded; in other words we look for the weak limit lim^ Pn 
in Mi(T>([0, T]) x X>([0,T])). This corresponds to a law of large numbers 
with the limit being a deterministic measure. This limit, being an element 
of A4i(V([0,T]) x V([0,T])), can be viewed as a stochastic process, and rep- 
resents the dynamics of the system in the limit N — > oo. The fluctuations of 
Pn around this deterministic limit will be studied in Section 3.3 below, and 
this turns out to be particularly relevant in the risk analysis of a portfolio 
(Section 4). 

The result we actually prove is a large deviation principle, which is much 
stronger than a law of large numbers. We start with some preliminary no- 
tions letting, in what follows, W G Mi(V([0,T]) x D([0,T])) denote the law 
of the {— 1, l} 2 -valued process (a(t),u;(t)) such that (c(0), w(0)) has distri- 
bution A, and both <r(-) and u>(-) change sign with constant intensity 1. For 
Q e Mi(p([0,T)) xP([0,T])) let 

H(Q\W) := ( / ^ ^ if Q « W and W G Ll(g) ' 

[ +oo, otherwise, 

denote the relative entropy between Q and W . Moreover, TltQ denotes the 
marginal law of Q at time t, and 



7i Q := 7 J aU t Q(da,dr). 



For a given path (<r[0, T], u[0, T)) G T>([0,T\) x V([0,T}), let iVf (resp. N? 
be the process counting the jumps of cr(-) [resp. uj(-)]. Define 



HQ) 

(7) 

whenever 



T (l _ e -M*M*) ) dt + /" T (! _ e -^(t)7 t Q ) dt 

j 

+ (3 [ T ' a(t)u(t-)dN? + / T cj(t) 7 , Q diV? 
Jo JO 



dQ, 



J(NZ + N£)dQ<+oc, 

and = otherwise. Finally let 

HQ) :=H(Q\W)-F(Q). 

We remark that, if / (N% + N%) dQ = +oo, then H(Q\W) = +oo (this will 
be shown in Section 5, Lemma 5.4) and thus also I(Q) = +oo. 
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Proposition 3.1. For each Q e Mi(V([0,T]) x 2?([0,T])), I(Q) > 0, 
and /(•) is a lower-semicontinuous function with compact level-sets [i.e., for 
each k > one has that {Q : I(Q) < k} is compact in the weak topology]. 
Moreover, for A,C C Aii(V([0, T]) x X?([0,T])) respectively open and closed 
for the weak topology, we have 

(8) liminf llogP( PiV £A)> - mf/CQ), 

(9) limsupi-logPGojv G C) < - inf J(Q). 

T/iis means that the distributions of obey a large deviation principle 
(LDP) with rate function /(•) (see, e.g., /i5/ /or the definition and funda- 
mental facts on LDP). 

The proof of Proposition 3.1 is given in Section 5 and follows from ar- 
guments similar to those in [12]. Various technical difficulties are due to 
unboundedness and noncontinuity of F, which are related to the nonre- 
versibility of the model. 

The key step to derive a law of large numbers from Proposition 3.1 is 
given in the following result, whose proof is also given in Section 5. In what 
follows, for q € M\({ — 1, l} 2 ) a probability on { — 1, l} 2 , we define 

m q'-= cr '?( cr ' u; )' 

(T,CJ=±1 

that can be interpreted as the expected rating under q. 



Proposition 3.2. The equation I(Q) = has a unique solution Q* . 
Moreover, if qt G M\{{ — 1, l} 2 ) denotes the marginal distribution of Q* at 
time t, then q t is the unique solution of the nonlinear (McKean-Vlasov) 
equation 

^ = £qu te[0,T], 

(10) 

Qo = A, 

where 

(11) Cq(a,co) = V'ie-^qfau)] + V w [e"^ m ^(o-,o;)] 

with (a,uj) G {-1,1} 2 . 



From Propositions 3.1 and 3.2, it is easy to derive the following strong 
law of large numbers. 
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Theorem 3.3. Let Q* e Mi(V([0,T]) x V([0,T])) be the probability 
given in Proposition 3.2. Then 

Pn Q* almost surely 

in the weak topology. 

Proof. Let Q* be the unique zero of the rate function /(•) as given by 
Proposition 3.2. Let Bq* be an arbitrary open neighborhood of Q* in the 
weak topology. By the upper bound in Proposition 3.1, we have 

limsup — \ogP{p N £B Q *)<- inf I(Q) < 0, 

N ^ Q£B Q * 

where the last inequality comes from lower semicontinuity of /(•), com- 
pactness of its level sets and the fact that L(Q) > for every Q ^ Q* . In- 
deed, if infQ^£ Q „ I{Q) = ; then there exists a sequence Q n ^ Bq* such that 
I{Qn) —> 0. By the compactness of the level sets there exists then a sub- 
sequence Q nk — > Q ^ Bq* . By lower semicontinuity it then follows I(Q) < 
liminf I(Q nk ) = which contradicts I(Q) > for q^Q* ■ By the above in- 
equality we thus have that P(pn ^ Bq-*) decays to exponentially fast. By 
a standard application of the Borel-Cantelli lemma, we obtain that p n — ► Q* 
almost surely. □ 

3.2. Equilibria of the limiting dynamics: phase transition. Equation (10) 
describes the dynamics of the system with generator (3) in the limit as N — > 
+oo. In this section we determine the equilibrium points, or stationary (in t) 
solutions of (10), that is, solutions of Cqt = and, more generally, the large 
time behavior of its solutions. First of all, it is convenient to reparametrize 
the unknown qt in (10). 

Let q be a probability on { — 1,1} . Note that each /:{ — 1,1} 2 — > M can 
be written in the form f(o~,cu) = aa + buj + caui + d. It follows that q is 
completely identified by the expectations 

m l-= Y ^(^w), 
(12) m£:= Yl uq{*,u), 

m T := crujq(a,uj). 

In particular, if q = qt, the marginal of Q* appearing in Proposition 3.2, then 
we write for m a , and similarly for mf,mf u '. In order to rewrite (10) in 
terms of the new variables mf ,m^,m^ w , observe that 

rh° '= Y o-q t (a,uj)= Y 
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On the other hand, a straightforward computation shows that, for every 
probability q, 

crCq = 2smh(f3)m^ -2cosh(f3)m°, 



giving 



2sinh(/3)r7# - 2cosh(/3)m ,T 



t ■ 



By making similar computations for m",m^, it is shown that (10) can be 
rewritten in the following form: 

ml = 2sinh(/3)m^ - 2cosh(^)m^, 
(13) mf = 2sinh(7m^) — 2 cosh (7777.^) m^ 1 , 



rh™ = 2sinh(/3) + 2sinh(7mf)m£ - 2(cosh(/3) + cosh^rr^))"^, 



with initial condition mQ = , = m^f , ttiq = m% . Note that m% u does 
not appear in the first and in the second equation in (13); this means that 
the differential system (13) is essentially two-dimensional: first one solves 
the two-dimensional system (on [— 1, l] 2 ) 

(14) (rh^,rht) = V(mt,mt), 

with y(x,y) = (2sinh(/5)?/ — 2cosh(/3)x,2sinh(7x) — 2ycosh(7x)), and then 
one solves the third equation in (13), which is linear in . Note also 
that to any (m£,m£) satisfying V{m% , m*) = 0, there corresponds a unique 

m™ := S1 ^osQi3)+cosh(^) ^ sucn that ( m * ' m * ' m * U ) is an equilibrium (stable 
solution) of (13). Moreover, if m" — > m% as t — > +00, then mf u — ► m% u . Thus, 
to discuss the equilibria of (13) and their stability, it is enough to analyze 
(14) and for this we have the following proposition, where by "linearly stable 
equilibrium" we mean a pair (x,y) such that V(x,y) = 0, and the linearized 
system (x,y) = DV(x,y)(x — x,y — y) is stable, that is, the eigenvalues of 
the Jacobian matrix DV(x,y) have all negative real parts. 

Theorem 3.4. (i) Suppose 7 < tan ^) ■ Then (14) has (0,0) as a unique 
equilibrium solution, which is globally asymptotically stable, that is, for every 
initial condition (mQ,mg), we have 

lim «,<) = (0,0). 

t— >+oo 

(ii) For 7 < tan ^^ the equilibrium (0, 0) is linearly stable. For 7 = tax ^^ 
the linearized system has a neutral direction, that is, DV(0, 0) has one zero 
eigenvalue. 
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(iii) For 7 > tan ^) the point (0,0) is still an equilibrium for (14), but 
it is a saddle point for the linearized system, that is, the matrix DV(0,0) 
has two nonzero real eigenvalues of opposite sign. Moreover (14) has two 
linearly stable solutions (m%,m%), (—m*,—m^), where m% is the unique 
strictly positive solution of the equation 

(15) x = tanh(/3) tanh(7x), 
and 

(16) < 



tanh(/3) * ' 

(iv) For 7 > j^grgj , the phase space [— 1, l] 2 is bipartitioned by a smooth 

curve r containing (0,0) such that [—1,1] \ T is the union of two disjoint 
sets r + ,r~ that are open in the induced topology of [— 1, l] 2 . Moreover 




lim (m^mf) = { (-<,-<), if(m^m^)eT 

t — >+oo 



Proof. See Section 5. □ 



i/«,mft)er+, 



Remark 3.5. The results in this section are specific to our model with 
the symmetry properties as induced by the specification of the intensities 
in (2). With an asymmetric setup such as described in Remark 2.1, (15) 
becomes 

x = tanh(/3) tanh(c/>(x)) 

thus allowing more flexibility in the position of the equilibria. In particular, 
by letting <j)(x) = j(x — K) + + 5, while still having three equilibria, we may 
choose their relative position by suitably choosing the values for 7, K, S. 
Notice that in this way we also increase the number of parameters in our 
model. 



3.3. Analysis of fluctuations: Central Limit Theorem. Having established 
a law of large numbers pn — > Q* , it is natural to analyze fluctuations around 
the limit, that is, the rate at which pn converges to Q* and the asymptotic 
distribution of pn — Q* ■ 

To study the asymptotic distribution of pn — Q* there are at least the 
following two possible approaches: 

(i) An approach based on a functional central limit theorem using a 
result in [2] that relates large deviations with the Central Limit Theorem 
(see [35], Chapter 3, for some results in this direction). 
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(ii) A weak convergence-type approach based on uniform convergence of 
the generators (see [17]). 

In this paper we shall follow an approach of the second type; more pre- 
cisely we shall provide a dynamical interpretation of the law of large numbers 
discussed in Theorem 3.3. Let ip : {— 1, l} 2 — > R, and define pN(t) by 

1 N 

J ipdp N (t) :=jj22i/>(<ri(t),Wi(t)). 

i=l 

In other words, pN(t) is the marginal of pjy at time t and we also have 
nvjy(t) = m p JV (t)- Note that, for each fixed t, pjv(i) is a probability on {—1, l} 2 , 
and so, by the considerations leading to (12), it can be viewed as a three- 
dimensional object. Thus {pN(t))te[o,T\ is a three-dimensional flow. A simple 
consequence of Theorem 3.3 is the following convergence of flows: 

(17) (PN(t))te[o,T\ -> (q t )te[o,T\ a.s., 

where the convergence of flows is meant in the uniform topology. Since the 
flow of marginals contains less information than the full measure of paths, 
the law of large numbers in (17) is weaker than the one in Theorem 3.3. 
However, the corresponding fluctuation flow 

(VN(p N (t) -qt))te[o,T\ 

is also a finite-dimensional flow, and it allows for a very explicit characteriza- 
tion of the limiting distribution. The following theorem gives the asymptotic 
behavior of this fluctuation flow; its proof is given in Section 5. 

Theorem 3.6. Consider the following three-dimensional fluctuation pro- 
cess: 



x N (t) := 1 




-ml). 


VN(t) ■= • 




-o 


z N (t) := ' 







Then (xN(t),yN(t), ZN(t)) converges as N— >oo, in the sense of weak con- 
vergence of stochastic processes, to a limiting three-dimensional Gaussian 
process (x(t),y(i),z(t)) which is the unique solution of the following linear 
stochastic differential equation: 



fdx(t)\ /x(t)\ fdB^m 

(18) dy(t) =A(t) y(t) dt + D(t) dB 2 (t) 

\dz(t)J \z(t)J \dB 3 (t)J 



18 



DAI PRA, RUNGGALDIER, SARTORI AND TOLOTTI 



where B\,B2,B^ are independent, standard Brownian motions, 

{ - cosh(/3) 

A(t) = 2 —7777^ sinh(7mf ) + 7 cosh(7mf) 

y sinh(7m^") + 7777^ cosh(7m^") — -fmf^ sinh^mf ) 

sinh(/3) \ 

— cosh(7m^) 

— (cosh(/3) + cosh(7mf ))y 

D(t)D*(t) 



I -mf sinh(/3) + cosh(/3) 
= — mf sinh (7777^) + cosh (7777^) 

\ —777^ sinh(/3) + mf cosh(/3) 777^ cosh(7m^") — sinh(7m^) 

—777,^" sinh(/3) + cosh(/3) \ 
777^ cosh(7777^) — sinh^m^) 
—m™ sinh(/3) + cosh(/3) — sinh(7mf ) + cosh^mf) J 

and (x(0),y(0), 2(0)) have a centered Gaussian distribution with covariance 
matrix 



(19) 



1 - 


K) 2 




— 777^777^ 


777- 


mf - 


- 777^777^ 


1 - 


(t<) 2 


777^ 


777^ - 




777^ — 


777^777^ 


1 - 



777 



l A " l A 

<ru;\2 



A 



Theorem 3.6 guarantees that, for each i > 0, the distribution of (xj\[(t), 
yjv(t), zw(t)) is asymptotically Gaussian, and provides a method to compute 
the limiting covariance matrix. Indeed, denote by Ej the covariance matrix 
of (x(t),y(t),z(t)). A simple application of Ito's rule to (18) shows that T, t 
solves the Lyapunov equation 



(20) 



dt 



A(t)Z t + Z t A(t)* + D{t)D*(t). 



In order to solve (20), it is convenient to interpret E as a vector in R 3x3 = 
R 3 (g)lR 3 . To avoid ambiguities, for a 3 x 3 matrix C we write vec(C) whenever 
we interpret it as a vector. It is easy to check that (20) can be rewritten as 
follows 



(21) 



d(vec(E f )) 
dt 



(A(t) ® I + I ® A(t)) vec(E 4 ) + vec(D(t)D* (t)), 



where "(8)" denotes the tensor product of matrices. Equation (21) is linear, 
so its solution can be given an explicit expression and can be computed after 
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having solved (13). More importantly, the behavior of Ej for large t can be 
obtained explicitly as follows. 

A. Case 7 < ^j^y- In this case we have shown in Theorem 3.4 that the 
solution (m% ,mf of (13) converges to (0, 0,tanh(/3)) as t — > +00. 
In particular, one immediately obtains the limits 

(22) A:= lim A(t), DD* := lim D(t)D*(t). 

A direct inspection (see the Appendix) shows that A has three real strictly 
negative eigenvalues. Moreover, the eigenvalues of the matrix A x I + 
I x A are all of the form Aj + Xj where Aj and Xj are eigenvalues of 
A, and therefore they are all strictly negative. It follows from (21) that 
lim^+oo T,t = E where 

(23) vec(E) = -(A ® I + 1 <g> A)" 1 vec (£>£>*). 

B. Case 7 > tan ^) • Also in this case, by Theorem 3.4, the limit 



lim (m^ ,mf ,m 

t— >+oo 



exists. Disregarding the exceptional case in which the initial condition of 
(13) belongs to the stable manifold T introduced in Theorem 3.4(iv), the 
limit above equals either (m* , m%, m£ w ), or (— mj, —m^,m1 u ), depending 
on the initial condition, where (mj,m^,mj^) are obtained by Theorem 
3.4(iii). In both cases one obtains as in (22) the limits A and DD*, and 
we show in the Appendix that also in this case the eigenvalues of A are 
real and strictly negative, so that lim^+oo = S is obtained as in (23). 
C. Case 7 = ^5Ep) 1 ^ n ^ s 

shown in the Appendix, the limiting 
matrix A is singular; it follows that the limit lim^+co does not exist, 
as one eigenvalue of Et grows polynomially in t. This means that, for 
critical values of the parameters, the size of normal fluctuations around 
the deterministic limit grows in time. Similarly to what is done in [8] 
for reversible models, it is possible to determine the critical long-time 
behavior of the fluctuation by a suitable space-time scaling in the model, 
giving rise to nonnormal fluctuations. More precisely, one can show the 
following convergence in distribution: 

iV 1/4 {m pN (VNt) - m (VNt)) Z 

where Z is non-Gaussian. This result is contained in [32]. 

We now state an immediate corollary of Theorem 3.6 concerning the fluc- 
tuations of the global health indicator; this will be used in the next section 
on large portfolio losses. 
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Corollary 3.7. As N — >■ oo we have that 



N H N a) - <1 

converges in law to a centered Gaussian random variable Z with variance 
(24) V(t) = V n (t), 

where solves (20) and ml solves (13). 



We conclude this section with the following: 



Remark 3.8. The evolution equation (20) for the covariance matrix 
is coupled with the McKean-Vlasov equation (13), and their joint behavior 
exhibits interesting aspects even before the system gets close to the stable 
fixed point. In particular, in the case 7 > tan ^^ , if the initial condition is 
sufficiently close to the stable manifold T, the system (13) spends some time 
close to the symmetric equilibrium (0, 0) before drifting to one of the stable 
equilibria. A closer look at (20) shows that when the system is close to 
the neutral equilibrium, the covariance matrix £ grows exponentially fast in 
time, causing sharp peaks in the variances. This is related to the credit crisis 
mentioned in the Introduction. A more detailed discussion on this point is 
given in the next section, in relation with applications to portfolio losses. 



4. Portfolio losses. We address now the problem of computing losses in 
a portfolio of positions issued by the N firms. A rather general modeling 
framework is to consider the total loss that a bank may suffer due to a 
risky portfolio at time t as a random variable defined by L N (t) = J2iLi{t). 
Different specifications for the single (marginal) losses Li(t) can be chosen 
accounting for heterogeneity, time dependence, interaction, macroeconomic 
factors and so on. A punctual treatment of this general modeling framework 
can be found in the book by McNeil, Frey and Embrechts [29]. For a com- 
parison with the most widely used industry examples of credit risk models 
see Frey and McNeil [21], Crouhy, Galai and Mark [11] or Gordy [25]. The 
same modeling insights are also developed in the most recent literature on 
risk management and large portfolio losses analysis; see [14, 19, 23, 26] for 
different specifications. 

In this paper we adopt the point of view of Giesecke and Weber [23]. The 
idea is to compute the aggregate losses as a sum of marginal losses Li(t), 
of which the distribution is supposed to depend on the realization of the 
variable Cj, that is, on the rating class. In particular, conditioned on the 
realization of a, the marginal losses will be assumed to be independent and 
identically distributed (the independence condition can be weakened; see 
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Example 4.4 below). More precisely, we assume given a suitable conditional 
distribution function G x , x £ { — 1,1}, namely 

(25) G x (u):=P(L i (t)<u\a i (t)=x) 
where the first and second moments are well defined, namely 

(26) h := E{Li{t)\ui{t) = 1) < E(Li(t)\<n(t) = -1) =: U 
and 

(27) Vl := Var(Li(t)\ai(t) = 1), u_i := Var(Li(t)\ai(t) = -1). 

The inequality in (26) specifies that we expect to lose more when in financial 
distress. 

The aggregate loss of a portfolio of volume N at time t is then defined as 

N 

L N (t) = ^2Ht). 

i=l 

We recall the definition of the global health indicators mj^(t) := i °i(^)> 
and m" := J adqt where qt solves the McKean-Vlasov equation [see (10)]. 

We also introduce a deterministic time function, which will be seen to 
represent an "asymptotic" loss when the number of firms goes to infinity, 
namely 

(28) L(t) = m t + . 

We state now the main result of this section. 

Theorem 4.1. Assume Li(t) has a distribution of the form (25). Then 
for t £ [0,T] with generic T > and for any value of the parameters [3 > 
and 7 > 0, we have 

- N 



in distribution, where L(t) has been defined in (28) and 

(29) y {t) = (h-l-i) 2 V(t) + (1+m^i + (1-m^-i 

with V(t) as defined in (24)- 



Proof. See Section 5. □ 
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Remark 4.2. The Gaussian approximation in Theorem 4.1 leads in 
particular to 

(30) P{L»<f)>a)~x(!!I£LS). 

By the symmetry of the model, the above Gaussian approximation for the 
losses is appropriate for a wide (depending on N) range of values of a. If we 
modify the model to become asymmetric as discussed in Remark 2.1 and, 
more precisely, we modify it so that a = — 1 becomes much less likely than 
a = +1, then for a "realistic" value of N, the number of firms with cjj = — 1 
could be too small for the Gaussian approximation to be sufficiently precise. 
One could then rather consider a Poisson-type approximation instead. 

We shall now provide examples illustrating possible specifications for the 
marginal loss distributions where, without loss of generality, we assume a 
unitary loss (e.g., loss due to a corporate bond) when a firm is in the bad 
state. 

We start with a very basic example where we assume that the marginal 
losses (when conditioned on the value of a) are deterministic. This means 
that the riskiness of the loss portfolio is related only to the number of firms 
in financial distress and so we can use directly the results of Section 3, in 
particular of Corollary 3.7. 



Example 4.3. Suppose that marginal losses are described as follows: 



1, if <T<(t) = -1, 

0, if<7i(t) = l. 



On the other hand 



i=l 

Recalling that mjj(t) = jfY^i a i(t), by Corollary 3.7 [see also (30)], we can 
compute various risk measures related to the portfolio losses such as the 
following Var-type measure: 

M- . /N-NmjjCt) \ { <r , s N -2a 
P(L N (t) >a)=P[ > a ) =P(m%(t) < 

(31) 



2 ~ V "ivw- N 



■2a + (l-mZ)N\ _ ^/ -2a + 2L°°(t)N\ 



where L°° (t) := lim^oo — ^ = lim^oo £ 



i 2N ~ 2 
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Loss probabilities varying the parameters 




i i i i i 

4400 4600 48O0 5000 5200 5400 5600 

Excess loss; x 



Fig. 1. Excess loss in a large portfolio (N — 10,000,) for different values of the parameters 
7 and j3 compared with the independence case. 

Looking at a portfolio of N = 10,000 small firms, we compute the excess 
loss probability for different values of the parameters f3, 7 comparing them 
with the benchmark case where there is no interaction at all, that is, where 
(3 = 7 = ("independence case"). In Figure 1 we show the cumulative prob- 
ability of having excess losses for the same portfolios. In this figure we see 
that, when the dependence increases, variance and risk measures increase as 
well. 

More general specifications are already suggested in the existing literature. 
For example, one could consider the losses to depend also on a random 
exogenous factor more precisely, the marginal losses Li(t) are independent 
and identically distributed conditionally to the realizations of the o"j(i)'s and 
of \E'. The conditional distributions 

G x (u) := P(Li(t) < u\ui(t) = x, *) 

are random variables, as well as the corresponding moments li,l-i,vi,V-\. 

In particular in the following example we apply our approach to a very 
tractable class of models, the "Bernoulli mixture models." This kind of mod- 
eling has been used in the context of cyclical correlations, that is in models 
where exogenous factors are supposed to characterize the evolution of the in- 
dicator of defaults (the classical factor models) . In the context of contagion- 
based models this class was first introduced by Giesecke and Weber in [23]. 
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Example 4.4 (Bernoulli mixture models). Assume that the marginal 
losses Li(t) are Bernoulli mixtures, that is, 

(o 9 \ j u\ — \ !> with probability P(<Ti(t), *), 

^ > \0, with probability 1-P((7i(t),*), 

where the mixing derives not only from the rating class indicator (Ji{t) of firm 
i, but also from an exogenous factor $£l p that represents macroeconomic 
variables that reflect the business cycle and thus allow for both contagion 
and cyclical effects on the rating probabilities. 

Notice that, with the above specification, the quantities defined in (26) 
and (27) now depend on the random factor that is, 

Zi = P(l,*), ui = P(l,*)(l-P(l,*)) and analogously for 

Consequently, the asymptotic loss function L(t) as well as the variance of the 
Gaussian approximation V{t) defined in (28) and (29) are also functions of 
^ . With a slight abuse of notation we shall write L^(t) [respectively V^(t)] 
for the asymptotic loss (variance) at time t given that = ip. 

Next we give a possible expression for the mixing distribution for P(cr, \P) 
that is in line with existing models on contagion. Let a and bi, i = 1,2, be 
nonnegative weight factors. Let us assume for simplicity that f 6 1 is a 
Gamma distributed random variable. Define then 

P(ff,¥) = l-expj-att-&ifc^ -fc}- 

This specification follows the CreditRisk+ modeling structure, even though 
in the standard industry examples direct contagion is not taken into account. 
Notice that the factor increases the probability of default for the firms 
in the bad rating class (cr = —1). Using (30) we have that 

P(L N (t) > a) « / M f NL ^)- a \ 

where is the density function of the Gamma random variable 

In Figure 2 we plot the excess loss probability in the case where a = 0.1, 
bi = 1, &2 = 0.5 and [5 = 1.5 is supposed to be fixed. We compare different 
specifications for ^ and 7. In particular we consider the following cases: 

^ = 4.5, 7 = 0.6; ^ = 4.5, 7 = 1.1; 

^~T(2.25,2), 7 = 1.1. 

The shape of the excess losses suggests that the loss may be sensibly higher 
in the case of high uncertainty about the value of the macroeconomic factor 
[\[r ~ T(2.25;2)] and in the case of high level of contagion (7 = 1.1). Notice 
that in all three situations we are in the subcritical case, since the critical 
value for 7 is 7 C = 1/ tanh(/3) ~ 1.105. This also implies that the equilibrium 
value is the same in the three situations and depends only on 
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Comparing excess losses varying the parameters 



0.2 







1 

V 

VV 
\\ • 
\\ 1 


- p=1.5, y=0.6. 4'=4.5 

(3=1.5, T=1.1. 4*=4.5 

p=1.5, 7=1.1, H'-r(2.25,2) 






\\ - 
\ - 






\ 
>Y 






1 V 
i V 






> \n 
\ - 







3000 3500 4000 4500 5000 

Excess loss: x 



Fig. 2. Loss amount in a large portfolio (N = 10,000,) in the case of marginal losses 
which (depending on the rating class) are distributed as Bernoulli random variables for 
which the parameter depends on Vt. 



Remark 4.5. Notice that the asymptotic loss distribution in the above 
Bernoulli mixture model does not only depend on a mixing parameter as 
in standard Bernoulli mixture models but, via L(t), it depends also on the 
value ml of the asymptotic average global health indicator. Moreover, com- 
pared to Giesecke and Weber [23], we are able to quantify the time- varying 
fluctuations of the global indicator "m a pN (ty We shall see that this may sen- 
sibly influence the distribution of losses in particular when looking at two 
different time horizons T\ and T2 before and after a credit crisis. 

Remark 4.6. Further examples may be considered, in particular when 
the distribution of the marginal losses Li(t) depends on the entire past 
trajectory of the rating indicator <7j, taking, for example, into account how 
long the firm has been in the bad state. Instead of depending simply on 
<jj(£), the distribution of Lj(i) could then be made dependent on Si(t) := 

f*m 1 with 5 G (0, 1), which is equal to 1 if firm i has spent 

a fraction 5 of time in the bad state. Corresponding to (32) we would then 
have 

, s _ f 1, with probability P(Si(t), ^), 
i{ ) ~ { 0, with probability 1 - P(Si(t), *). 

This model is not a straightforward extension of Example 4.4. In fact the 
theory developed above, in particular the Central Limit Theorem result in 
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Section 3.3, does not appear to be strong enough to handle it. For this 
purpose an approach based on a functional central limit theorem that was 
alluded to at the beginning of Section 3.3 would be more appropriate. This, 
however, goes beyond the scope of the present paper. 

Let us point out that in the examples above we have considered only the 
problem of computing large portfolio losses which led to examples where we 
computed (approximately) the quantiles P(L N (t) > a) where a is a (large) 
integer. From here, one could then compute the probability that the loss 

L N (t) 

ratio jf ' belongs to a given interval and this would then allow to compute 
(approximately) for our contagion model also other quantities in a risk- 
sensitive environment. In any case notice that Theorem 4.1 provides the 
entire asymptotic distribution for the portfolio losses. 

In the previous examples we have described large portfolio losses at a 
predetermined time horizon T for different specifications of the conditional 
loss distribution. In what follows, we shall describe in more detail how the 
phenomenon of a credit crisis may be explained in our setting and how this 
issue may influence the quantification of losses. This dynamic point of view 
on risk management that accounts for the possibility of a credit crisis in the 
market, is one of the main contributions of this work. 

As one could expect, the possibility of having a credit crisis is related 
to the existence of particular conditions on the market, more precisely to 
certain levels of interaction between the obligors (i.e., the parameters /3 and 
7) and certain values of the state variables describing the rating classes and 
the fundamentals (i.e., a and 00). 

4.1. Simulation results. To illustrate the situation we shall now present 
some simulation results. We shall proceed along two steps: the first one 
relates more specifically to the particle system, the second to the portfolio 
losses. 

Step 1 (Domains of attraction). In Section 3.2 we have characterized all 
the equilibria of the system depending on the values of the parameters. In 
particular we have shown that for supercritical values, by which we mean 
7 > tan ^^ , there are two asymmetric equilibrium configurations in the space 
(m cr ,m UJ ) that, for our symmetric model, are symmetric to one another and 
are defined as (m£,m*) and (— mj, — m%). 

In particular, Theorem 3.4 allows to characterize their domains of at- 
traction, that is, the sets of initial conditions that lead the trajectory to 
one of the equilibria, and we shall denote them by r + and T~ . Numerical 
simulations provide diagrams as in Figure 3. 
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Domains of attraction of (m? .m?*) and (-mf,-ml*); p=1 , y varying 




rrf 



Fig. 3. Domains of attraction F + for (mj,m") and F for (— mj,— m») and 
i/ieir boundary F for (3 = 1 and varying 7. //ere t/ie critical value for 7 is 
7c := l/tanh(/3) ~ 1.313. 

.Step 2 (Credit crises). We show results from numerical simulations that 
detect the crises when the values of the parameters are supercritical and the 
initial conditions are "near" the boundary of the domains of attraction, that 
is, near T. Given the symmetry of our model, the behavior of the system will 
be perfectly symmetric when starting in either T + or T~, but the typical 
credit crisis corresponds to what happens in T~ , so that below we shall 
illustrate this latter case. The analysis in an asymmetric model would be 
analogous. 

In Figure 4 we have plotted a trajectory starting in (m^mj)) £ T~ but 
near the boundary. It can be seen that the path moves toward {m a ,m u ) = 
(0, 0) and then leaves it decaying to the stable equilibrium. 

Concerning the time evolution, we see in Figure 5 that, for an initial 
condition in T~ and near the boundary, the variable mf (the same would 
happen also with mf that for clarity is not plotted) is first attracted to the 
unstable value zero, around which it spends a long time before moving to 
the stable equilibrium value m°. This can be explained, in financial terms, 
as follows: 

Suppose that at the initial time the market conditions are such that 
(m a \m w ) are in T~ but close to the curve T. Then for a while the system 
moves close to the stable manifold T toward (0,0), until it gets "captured" 
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Phase diagram of a trajectory of (m°,m") 




Fig. 4. Domains of attraction F + for (m",m^) and T for (— m%, —m%) and phase dia- 
gram of (mf ,711") with initial conditions (mj.mj) = (0.6, —0.85) when f3 = 1 and 7 = 2.3 
[here -y c = l/tanh(/3) ~ 1.313/. 



by the unstable direction of the equilibrium point (0,0). Since the system 
configuration belongs to T~ , the new stable equilibrium that the system is 
attracted to is given by (— m°, — m^). 

This situation represents (in a stylized manner) what we intend as a 
credit crisis: the state (0,0) may be considered as a "credit bubble," the 
decay toward the stable equilibrium mimics a credit crisis (i.e., a crash in 
the credit market). 

As soon as the system moves away from (0,0), the uncertainty (volatility) 
increases quickly and the credit quality indicators move to the stable con- 
figuration changing completely the picture of the market (the speed of the 
convergence depends on the level of interaction). 

This situation is also well illustrated by the loss probability computed 
before and after the crisis (i.e., in certain time instants T\ and T2). In Figure 
6 we see the excess probability of suffering a loss larger than x for the 
case of Example 4.4 with an exogenous parameter ~ T(2.25;2). One can 
see that before the crisis both the expected loss and the variance may be 
underestimated as well as the corresponding risk measures. Put differently, 
a model that does not distinguish between stable and unstable equilibria 
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Fig. 5. Trajectory of m% and V(t) with initial conditions = —0.5, mo = 0.395 when 
(3 — 1.5 and 7 = 2.1 [here 7 C = l/tanh(/3) ~ 1.105/. We have marked by (*) the time hori- 
zons T\ =2 and T2 = 10 before and after the crisis where in Figure 6 we shall compute the 
excess loss probabilities. 



(does not take credit crises into account) may underestimate the excess loss 
probability, since it does not recognize in the given situation the possibility 
of a sudden crash. 

Finally we mention the fact that for different levels of interaction we can 
distinguish between a smoothly varying business cycle and a crisis. When 
(3 and 7, the parameters describing the level of interaction, are sufficiently 
small, the business cycle (described in our simple model by the proportion of 
firms in the rating classes) evolves smoothly and the induced variance (level 
of uncertainty about the number of bad rated firms) is lower compared to 
the crisis case. In Figure 7 we show this fact for two levels of (5 and 7, both 
supercritical. 

5. Proofs. 

5.1. Proofs of Propositions 3.1 and 3.2. One of the main tools in this 
proof is the Girsanov formula for Markov chains. Since a Markov chain is a 
functional of the multivariate point process that counts the jumps between 
all pairs of states, this formula can be derived from the corresponding Gir- 
sanov formula for point processes (see, e.g., [3], Section 4.2). We state it here 
for completeness. 
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Excess loss probabilities before and after the crisis 
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Fig. 6. Excess probability of losses in a portfolio of N — 10,000 obligors, f3 = 1.5 and 
7 = 2.1 computed in T\ — 2 and Ti = 10, namely before and after the crisis in the case of 
Example 44 with * ~ T(2.25;2) [here 7c = l/tanh(/3) ~ 1.105/. 



Trajectories of rr^ and V(t) varying the parameters 
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Fig. 7. Trajectories of ml and V(t) for different levels of interaction, that is, letting j3 
and 7 vary. In the case of higher values we really see a crisis and a corresponding peak in 
the uncertainty in the market. In the case of smaller values the number of bad rated firms 
decreases smoothly to a new equilibrium, that is, toward a bad business cycle. The critical 
values for -y are, respectively, 1/ tanh(1.5) ~ 1.105 and 1/ tanh(0.9) ~ 1.396. 
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Proposition 5.1. Let S be a finite set, and (-X"(t))t e [o,T]> (X(*))te[o,T] 
two S-valued Markov chains with infinitesimal generators, respectively, 

Lf(x) = Y,L x , y [f(y)-f(x)], 
Mf(x) = Y,M x , y [f(y)-f(x)}. 

y^x 

Assume X(0) and Y(0) have the same distribution, and denote by Px and 
Py the law of the two processes on the appropriate set of trajectories in 
the time-interval [0, T]. Assume that whenever M XtV = also L XtV = 0. Then 
Px < Py, and 

g(*([0,T])) 



cxp 



/ E (M x(t)tV -L x( t )tV )dt+ I log j(t-)At) m 

JO y ^ x{t) Jo M x( t- )At) 



where x(t ) := lim s ^ x(s), log jj = 1 and Nt is the counting process that 
counts the jumps of the trajectory x([0,T]). 

In what follows we denote by Px the law on the path space of (ct[0,T], 
w[0,T]) G ('D([0,T])) 2N under the interacting dynamics, with initial condi- 
tions such that (^(O^wf ^))^ ! are independent and identically dis- 
tributed with an assigned law A (see beginning of Section 3). As in Section 
3.1 we let W £ Mi(V([0,T\) x V([0,T])) denote the law of the {-1,1} 2 - 
valued process (a(t),uj(t)) such that (cj(0),lj(0)) has distribution A, and 
both a(-) and uj(-) change sign with constant rate 1. By W® N we mean the 
product of N copies of W. We begin with some preliminary lemmas. 

Lemma 5.2. 

(33) (a[0,T], u[0,T\) = exp[NF(p N (a[0,T], u[0,T]))}, 

where F is the function defined in (7). 

Proof. Let (N^-(i))fLi be the multivariate counting process which counts 
the jumps of Oi for i = 1, . . . , N , and (Nf-(i))fL 1 be the multivariate counting 
process which counts the jumps of uji for % = 1, . . . ,N . Since each jump of the 
trajectory (cr[0,T], w[0,T]) is counted by exactly one of the above counting 
processes, Proposition 5.1 applied to this case yields 

dP N 



dW® N 



(a[0,T], u[0,T]) 
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yf (i_ e -/M*)"i(*) )c ft + v / i g e -to(*~> w '(*~)divf(i) 

N T 



N t 

+ 2_^f loge p jv ( * > dNf(i) 



i=l 



Since, with probability 1 with respect to W® N , there are no simultaneous 
jumps, we have 

TV rp /V rp 

£ / loge"^^^) diVf(i)=£ / -P{-ai(t))ui(t)dN?(i) 



and 



^ /-T /.-a £ W /-T 

ge w(« > dN-(i) = y 

i=i 



TV T 

Si "> 

i=l JU 







from which (33) follows easily after having observed that, VF®^ almost 
surely, 

y (^+iv^)dp 7V <+oo, 

and that simultaneous jumps of a and uj do not occur under dW m . □ 

The main problem in the proof of Proposition 3.1 is related to the fact 
that the function F in (7) is neither continuous nor bounded. The following 
technical lemmas have the purpose of circumventing this problem. In what 
follows, we let 



(34) 



T:=|qgMi(P[0,T] 2 ): j (JV? + JV£) dQ < +oo|. 



We first define, for r > and Q G X, 

rT 



Fr(Q) 







(35) 



+ / (/3ff(t)w(*-)-logr)diVf 



+ / -logr)dA£ 
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Note that F = F\. Moreover, Lemma 5.2 can be easily extended to show 
that 

dP N 



(36) 



(a[0, T] , u [0, T\) = exp [NF r (p N (a[0, T] , u [0, T] ))] , 



dW r m 

where W r is the law of the {—1, l} 2 -valued process a(t),co(t) such that (o"(0), 
tij(O)) has distribution A, and both cr(-) and w(-) change sign with constant 
rate r. 

Lemma 5.3. For < r < min(e - ^, e -7 ), F r is lower semicontinuous on 
Z. For r > max(e^,e 7 ), F r is upper semicontinuous. 

Proof. By definition of weak topology the fact that the map 



Mth? )dt 



dQ 



is continuous is rather straightforward (since Q-expectations of bounded 
continuous functions in 2?([0,T]) are continuous in Q). Thus we only have 
to deal with the term 

rT 



(37) 



{Pa(t)u(t-)- log r)dN? 



dQ 



+ 



Wt)#- log r)dNt 



dQ. 



We show that for < r < min(e - ^, e -7 ) the expression in (37) is lower semi- 
continuous in Q G Z. This shows that F r is lower semicontinuous. The case 
r > max(e^,e 7 ) is treated similarly. 

For e > consider the function cp £ : V[0, T] — > M. defined by 



<Pe(ri) :z 



if r](t) jumps for some t G (0, e], 
otherwise. 

Given r\ G V([0,T]) we define r/(s) for s > T by letting rj(s) = rj{T). Then, 
letting 9t denote the shift operator, we have that, for t G [0, T], 9trj is the 
element of £>([0, T]) given by 9tr](s) := r/(t + s). Consider now two functions 
f,g:{-l,l} 2 ^R, and define f £ , g £ : T>[0, T] 2 -^Rby 

/ £ (<7[ ,T],^[o,T]) :=inf{/(<T(t),w(t)):t G (0,e)}, 
and similarly for g £ . Then define 



®e(°~{o,T],U[o, T ]) J o f £ (0tcr,0tuj)(p £ (9ta)dt + g £ (9 t a,9 t u;)ip £ (9 t uj) dt. 

The key to the continuation of the proof below are the following two proper- 
ties of 3> e . These properties are essentially straightforward, and their proofs 
are omitted: 
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$ e is continuous and bounded on {(o-\ 0i t]i u [o,t]) '-^t + -^r < +°°} • 
Suppose f,g>0. Then, assuming 0"[o,t] > w [o,T] have a finite number of 
jumps, $ e (a"r 0! r]) w [o,T]) increases when e J, to 

[ T f(<r t - , w t - ) diVf + [ T g(v t - , <»t- ) dNf . 
Jo Jo 



Therefore by monotone convergence 



T f(a t - , u t - ) dN? + ( T g(a t - , ^- ) dN? 
o Jo 



dQ 



£>0 

In particular, the map 

rT 



sup / $ £ (crr 0)T ],a;r 0jT ])dQ. 

£>0 J 



/ / f(a t -,u t -)dN[+ [ g(a t -,u t -)dN? 
J Uo Jo 



dQ 



is lower semicontinuous on I. 



Now, for r < min(e @ ,e 7 ), the function f(a,u>) = —(3o~cu — logr is nonnega- 
tive. As for the function g, that should be — oj(t)^ — logr, we notice that it 
is not a function of (a,uj), but rather a function of (a, TLtQ), thus depending 
explicitly on t and Q. However, due to its boundedness and the fact that 
is continuous in Q uniformly in t, a, the argument above applies with minor 
modifications thus leading to the conclusion of the proof. □ 

Lemma 5.4. Let Q £ Mi{V([0,T}) 2 ) be such that H(Q\W) < +oo. Then 
Q El. The same result applies ifW r replaces W. 

Proof. By the entropy inequality (see (6.2.14) in [15]) 

J N£dQ< log J e N T dW + H(Q\W). 

But N£ has Poisson distribution under W, so / e N T dW < +oo. By applying 
the same argument to N^, the proof is completed. This proof extends with 
no modifications to the case r / 1. □ 

Lemma 5.5. The function 

I{Q):=H{Q\W)-F{Q) 
is lower semicontinuous on _Mi(£>[0,T] 2 ). 
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Proof. It is well known (see [15], Lemma 6.2.13) that the entropy 
H(Q\W) is lower semicontinuous in Q in all of Mi(T>([0,T]) 2 ). Moreover, 
by definition, F(Q) < +oo for every Q, and so we have H(Q\W) = I(Q) 
whenever H(Q\W) = +oo. Since, by Lemma 5.4, H(Q\W) = +oo for Q <£l, 
we are left to prove the following two statements: 

(i) I(Q) is lower semicontinuous in X. 

(ii) If H(Q\W) = +oo and Q n ^Q weakly, then I(Q n ) ->• +oo. 

The following key identity, which holds for r > 0, is a simple consequence 
of the definition of relative entropy and of the Girsanov formula for Markov 
chains. 

H(Q\W r ) = H(Q\W) + J log dQ 

(38) 

= H(Q\W) +2T(r - 1) + logr / (N^ + N^)dQ. 

In particular, by Lemma 5.4, we have that H{Q\W) < +oo <^=^ H(Q\W r ) < 
+oo. A simple consequence of (38) is then the following: 

(39) I(Q) = H(Q\W r )-F r (Q), 

where the difference in (39) is meant to be +oo whenever H(Q\W r ) = +oo 
[which is equivalent to H(Q\W) = +oo]. 

We are now ready to prove (i) and (ii) . To prove (i) it is enough to choose 
r > max(e^,e 7 ) and use Lemma 5.3. Moreover, for the same choice of r, the 
stochastic integrals in (35) are nonpositive, so F r {Q) < 2Tr. Therefore, if 
H{Q\W) = +oo and Q n ^Q, 

liminf I{Q n ) > liminf H{Q n \W r ) - 2Tr = +oo, 

where the last equality follows from lower semicontinuity of H{-\W r ) and 
H(Q\W r ) = +oo. Thus (ii) is proved. □ 

Lemma 5.6. The function I(Q) has compact level sets, that is, for every 
k > the set {Q : I(Q) < A;} is compact. 

Proof. Choosing, as above, r > max(e^,e 7 ), we have that F r (Q) < 2Tr 
for every Q. Thus, by (39), 

{Q : I(Q) < k} C {Q : H(Q\W r ) <k + 2Tr}. 

Since (see [15], Lemma 6.2.13) the relative entropy has compact level sets, 
{Q:I(Q) < k} is contained in a compact set. Moreover, by lower semiconti- 
nuity of /, {Q : I(Q) < k} is closed, and this completes the proof. □ 
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Lemma 5.7. For every r > there exists 5 > 1 such that 
limsup— log / exp[6NF r (pN)]dW? N < +oo. 

Proof. We give the proof for r = 1; the modifications for the general 
case are obvious. The proof consists of rather simple manipulations. The idea 
can be summarized as follows. If 8 = 1, then by Lemma 5.2, exp[5NF(pN)] 
is the Radon-Nikodym derivative of P/v with respect to W® N , and therefore 
has expectation 1. For 5 > 1, we write 8F(pn) = Fi(p^) + F2{pn) in such a 
way that F2 is bounded and exp[A r i ? i(p7v)] is a Radon-Nikodym derivative 
of a probability with respect to W® N . More specifically, observe that, using 
(7), 

N j> N t 

i=1 Jo i=1 Jo 

N rT 

+ W {8-8e-^ i{t)m ^)dt 

N rT 



i=i J0 

N T N T 

l=1 Jo i=1 Jo 

+ E/ {l-e- 5 ^ [t)m ^)dt + y SwQmUQdm 
i=i Jo i=i Jo 

N T 

i=i Jo 

N T 

+ W (5- 8e-^ i(t)m ^) - (1 - e -^W ro ^(*))) dt 
i=i ^° 

= NF 1 (p N ) + NF 2 ( PN ), 

where 

NF 1 (p N ):=J2 [ (l- e - 5 to® u '®)dt + T, / 60*^)^(1;) dfif(i) 

N x N r T 
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and 

NF 2 (p N ) -=y [ (S- Se-f***®"*® - (1 - e - spai ® Ui ®)) dt 

N j> 

Note that exp[./V\Fi(/9jv)] has the same form of exp[NF(pjy)] after having 
replaced (3 by 6/3. In particular, J exp[A r Fi(p7v)] dW® N = 1. Moreover, it is 
easy to see that 

F 2 (p N ) < T(25 - 6(e~P + e" 7 ) - 2 + e 5f3 + e 5 ^). 

Putting all together, we obtain 

J exp[5NF(p N )]dW m 

< exp[iVr(2<5 - 5{e~ p + e" 7 ) 

= exp[iVr(25 - <5(e^ /3 + e~ 7 ) 
from which the conclusion follows easily. □ 

Completing the proof of Proposition 3.1. It remains to show 
the upper and the lower bounds (9) and (8). We prove them separately; our 
main tool is the Varadhan Lemma in the version in [15], Lemmas 4.3.4 and 
4.3.6. 

We deal first with the upper bound (9). Take r > max(e^, e 7 ), so that the 
function F r in (35) is upper semicontinuous. Denote by 7-V the distribution 
of pn under P/v, and by Wat its distribution under W® N . By (36) 

(40) ^L {Q)=eMNFrm 

By Sanov's theorem (Theorem 6.2.10 in [15]), the sequence of probabilities 
Wn satisfies a large deviation principle with rate function H(Q\W r ). Since 
F r is upper semicontinuous and satisfies the super exponential estimate in 
Lemma 5.7, we can apply Lemma 4.3.6 in [15], together with identity (39), 
to obtain the upper bound (9). The lower bound (8) is proved similarly, 
by taking < r < min(e - ^, e~ 7 ), so that F r becomes lower semicontinuous, 
using (40) again and Lemma 4.3.4 in [15]. □ 

The remaining part of this section is devoted to the proof of Proposition 
3.2. It mainly consists in giving an alternative representation of the rate 
function I(Q). 



- 2 + e sp + e h )} J exp[NF!(p N )] dW 
-2 + e^ + e 57 )], 
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Let now Q G Mi{V([0,T\) x P([0,T])). We associate with Q the law of 
a time-inhomogeneous Markov process on {— 1,1} 2 which evolves according 
to the following rules: 



a — > —a 

with intensity e~ 

ijj — > —ijj 



with intensity exp 



-JU <jU t Q(a,T) 

<7,re{-l,l} 



-it 



and with initial distribution A. We denote by P® the law of this process. In 
other words, P® is the law of the Markov process on { — 1,1} 2 with initial 
distribution A and time-dependent generator 

£ ( /( ( i, W ) = e-^V7(a, W )+e- 7 ™MV w /(^). 

Lemma 5.8. For every Q G Mi(V(%T\) x V([0,T})) such that I(Q) < 
+00, we have 

I{Q)=H(Q\PQ). 

Proof. We begin by observing that, since by assumption I(Q) < 00, 
we have H(Q\W) < +00 and so by Lemma 5.4 it follows that Q GX, which 
implies that the integrals below are well defined. Using again Girsanov's 
formula for Markov chains in Proposition 5.1, we obtain 

r dP Q 
J log— (a[0,T]M0,T])dQ 

(1 _ e-M^W)^ + ( T (i - e -Mt)f^tQ(da,dr^ dt 



+ / {-Pa{r)uj{t-))dNZ 



+ 



aIl t -Q{da, dr) 



dm 



dQ 



(l_ e -M*M*))rf t+ f (i_ e -i"(t)J°n t Q(d*,dT)^ dt 



+ P I a(t)uj{t) dN? + 7 / u(t) 

T 











aIL t Q(do; dr) 



dm 



dQ 



1-e 



-p<r(t)u(t)< 







)dt+ / {l-e-"®^)dt 
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+ P ( T a{t)uo{t) dN? + [ T u(t)-f® dN? dQ 
Jo Jo 

= F{Q). 
Finally, just observe that 

dQ f dP® 



dW 



I(Q) = J dQ log^- J dQ log' 

= J d Q\og^ = H{Q\P^). □ 



Completing the proof of Proposition 3.2. By properness of the 
relative entropy [H(n\v) = \i = v\, from Lemma 5.8 we have that the 
equation I(Q) = is equivalent to Q = P® . Suppose Q* is a solution of this 
last equation. Then, in particular, qt := HtQ* = H-tP ■ The marginals of a 
Markov process are solutions of the corresponding forward equation that, 
in this case, leads to the fact that q t is a solution of (10). This differential 
equation, being an equation in finite dimension with locally Lipschitz coeffi- 
cients, has at most one solution in [0,T]. Since P® is totally determined by 
the flow qt, it follows that equation Q = P® has at most one solution. The 
existence of a solution follows from the fact that I(Q) is the rate function 
of a LDP, and therefore must have at least one zero, indeed, by (8) with 
A = Mi(D[0,T] x V[0,T]), we get MqI(Q) = 0. Since I is lower semicon- 
tinuous, this inf is actually a minimum. □ 



5.2. Proof of Theorem 3.4- We first observe that the square [— 1,1] 2 
is stable for the flow of (14), since the vector field V(x,y) points inward 
at the boundary of [— 1,1] 2 . It is also immediately seen that the equation 
V(x, y) = holds if and only if x = tanh(/3) tanh(7x) and y = tan h(/3) a: ' M° re ~ 
over a simple convexity argument shows that x = tanh(/5) tanh(7x) has x = 
as unique solution for 7 < tan j L ^^ , while for 7 > a strictly positive so- 

lution, and its opposite, bifurcate from the null solution. We have therefore 
found all equilibria of (14). 

We now remark that (14) has no cycles (periodic solutions). Indeed, sup- 
pose (xt,yt) is a cycle of period T. Then by the Divergence Theorem 

(41) 0< / [V 1 (x t ,yt)xt + V 2 (xt,yt)yt]dt= [ divV(x,y)dxdy, 
Jo Jc 

where V\ , V2 are the components of V and C is the open set enclosed by the 
cycle. But a simple direct computation shows that divV(x,y) < in all of 
[— 1,1] 2 , so that (41) cannot hold. 
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It follows by the Poincare-Bendixon theorem that every solution must 
converge to an equilibrium as t — > +00. This completes the proof of (i). The 
matrix of the linearized system is 

Dvm = (-2ca&W 2sinh(/^ 

from which also (ii) and (iii) are readily shown. It remains to show (iv). 
For 7 > tiSEra) ' we ^ Vs ^ e an eigenvector of the negative eigenvalue of 
W(0,0). By the Stable Manifold Theorem (see Section 2.7 in [30]), the 
set of initial conditions that are asymptotically driven to (0, 0) form a one- 
dimensional manifold T that is tangent to v s at (0,0). Since any solution 
converges to an equilibrium point, and solutions starting in T c cannot cross 
r (otherwise uniqueness would be violated), the remaining part of statement 
(iv) follows. 

5.3. Proof of Theorem 3.6. 

Proof. One key remark is the fact that the stochastic process ( m ^ N (ty 
m w l+ \,mZ^ n\) is a sufficient statistic for our model; in this context this 
means that its evolution is Markovian. This can be proved by checking that 
if we apply the generator L in (3) to a function of the form ( / 9 ( m p Ar (j)) m p JV (t); 
mZ 1 ^ n\ ) i then we obtain again a function of (m a „ M , U \ > r*\)- A long 

PN(t) n ° V PN(t)> PN(t) , PN(t)J o 

but straightforward computation actually gives 

M^ (t) ,m^ (t) ,m~ (t) ) = [lC N ip](m° N{t) ,m" pN{t) ,m™ {t) ), 

where 



_ N 
~ ~4 



-j E [j£ + kr) + jk9 + l] 



(42) 



(i,fc)e{-i,i} 2 



V>(t- jjj, v,o- jfjk) - fit* e ) 



This implies that KLm is the infinitesimal generator of the three-dimensional 
Markov process (m^ (t) ,m^ (t) ,m^ (t) ). Note now that (x N (t),y N (t), z N (t)) 
is obtained from ( m p JV (t)) m p iV (t)> m p^(t)) through a time dependent, linear 
invertible transformation. We call Tt this transformation, that is, 
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(the dependence on TV of T t is omitted in the notation). Therefore (xN(t),yN(t), 
zm(t)) is itself a (time-inhomogeneous) Markov process, whose infinitesimal 
generator Ti^t can be obtained from (42) as follows: 

H N , t f(x, y, z) = K N [fo T t ](irHx, y, z)) + ^[/ o T t ]{T^\x, y, z)). 

A simple computation gives then 
T<N,tf(x,y,z) 
_ N 

~ T 



E 



j—j= + k-j= + jk—= + jml + km? + jkm™ + 1 



(43) 



N 



N 
f[x- 



+ e 



N 



j,y,z 



f[ x,y 



jk ) - f(x,y,z) 
2 



N 



k, z 



jk 

'N 
-f(x,y,z) 



'NrnZMx^z) - VNm?f y (x,y,z) - V Nm?" f z (x,y, z), 

where f x stands for and similarly for the other derivatives. At this 
point we compute the asymptotics of Ti.N,tf{x,y,z) as iV — ► +oo, assum- 
ing /:R 3 — > R a C 3 function with compact support. First of all we make a 
Taylor expansion of terms like 



f x 



3,y,z 



(44) 

and 

(45) 



fx(x,y,z) 



jk - f(x,y,z) 



fz(x,y,z) 



2 2 4 / 1 \ 

+ jjfxx(x,y,z) + —fzz(x,y,z) + —f xz (x,y,z) + o(—j 



-~/(x/VN) 



1-7 



+ o 



Note that, since all derivatives of / are bounded, the remainder in (44) is 
o(jf) uniformly in (x,y,z) £ R 3 . Moreover, the remainder in (45) is °(^t^) 
uniformly for x in a compact set. Therefore, since / has compact support, 
when we use (44) and (45) to replace the corresponding terms in (43), we 
obtain remainders whose bounds are uniform in R 3 . When (44) and (45) are 
plugged into (43), all terms of order y/~N coming from the sum over (j,k) G 
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{— 1,1} 2 are canceled by the terms ^/Nm" f x (x,y,z) — y/Nrhf f y (x,y,z) — 
\[Nra1' JJ f z (x,y, z). It follows then by a straightforward computation that 

lim sup sup \H N ,tf{x,y,z)-H t f{x,y : z)\=0, 
N -*°°te[o,T]x,y,zem. 3 

where 

H t f(x, y, z) = 2{/,[-xcosh(/3) + y sinh(/3)] 

+ f y [— 'yxm'^ sinh(7m^) + 7rrcosh(7m^) — ycosh(7m^)] 
+ f z [ x sinh(7m^) + 7xm" cosh^m^) 

— 7sm™ sinh(7m^) — zcosh(/3) — zcosh(7m^)] 
+ /^[-mrsmh(/3) + cosh(/3)] 

(46) 

+ fyy[—mf sinh(7m^) + cosh(7m^)] 
+ / zz [-mrsinh(/3)+cosh(/3) 

— rrtf sinh(7m^) + cosh(7m^)] 
+ 2f xz [-m1 sinh(/3) + mf cosh(/?)] 

+ 2f yz [m1 cosh(7mf ) - m? u sinn^m^)]} 

is the infinitesimal generator of the linear diffusion process (18). Using Theo- 
rem 1.6.1 in [17], the proof is completed if we show that (xat(O), 2/tv(0), zjv(O)) 
converges as N — > +oo, in distribution to (x(0), t/(0), z(0)). This last state- 
ment follows by the standard Central Limit Theorem for i.i.d. random vari- 
ables; indeed, by assumption, (o"i(0), Wj(0)) are independent with law A, 
and (19) is just the covariance matrix under A of (er(0), w(0), <t(0)u;(0)). It 
should be pointed out that Theorem 1.6.1 in [17] does not deal explicitly 
with time-dependent generators, as is the case here. To fix this point it 
is enough to introduce an additional variable, r(t) := t, and consider the 
process a(t) := (x(t),y(t), z(t),T(t)), whose generator is time-homogeneous. 
This argument, together with the fact that the convergence of H.N,tf{x-, z) 
to 7i.tf(x,y,z) is uniform in both (x,y,z) and t, completes the proof. □ 

5.4. Proof of Theorem 4-1- We start with a technical lemma. 

Lemma 5.9. For t£ [0,T] we have the convergence in distribution 



where L(t) is defined in (28) and V(t) in (24)- 
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Proof. Define, for x £ {— 1, 1}, the quantity A% (t) as the number of <7j 
that, at a given time t, are equal to x. We may then write l+m N^ — 

and 1 m 2 N — = — Recall moreover that for iV — > oo, mj[(t) — > m£\ We 
then have 



^?(^W-L(t)) 



v ^/l+m^) +u l-m^) 



L(i) 

2 2 2 * 2 

*(*4^<n* w - ™n) - x ^(o, "'-'- 4 '^> ) , 

where the last convergence follows from Corollary 3.7 noticing that mj^(t) = 

PN(t) 

Proof of Theorem 4.1. We have to check that 

^(^LM - L(t)^Y ~ N(0,V(t)), 

where V{t) is defined in (29). 

Separating the firms according to whether their <7j(t) is +1 or —1, 

VN^ L / t] - = v^v( Ej: ' J - W=lLj(t) ^ : ' T - (t)= - lLj(t) - !(*)) . 
We then add and subtract Z CT . to obtain 

N 

Ej :CTj (f)=-i(^^)-^-i) Sj^-(t) _ Lf A 

iV N {) ' 



(47) 



Since we have only independence conditionally on a(t), we need to check 
whether the CLT still applies. Let us show the convergence of the corre- 
sponding characteristic functions: 



E 



cxp 



f L»(t)-NL(t) y 

r — — j 
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E 



(48) 



E 



exp < ir 



£i: ffj .( t )=l(£j(*)-*l) 



+ 



N 



+ 



N 



sit) 



The last of the three terms is measurable with respect to the sigma algebra 
generated by a(t) so that we can take it out from the inner expectation. Be- 
cause of the conditional independence we can separate the remaining terms 
in the product of conditional expectations: 



E 



exp 



p^M-iO } |g(t) 



x E 



exp< ir 



£ i:ffi ( t )=-i(£;(t)-U) 



N 



sit) 



By conditional independence, 



E 



exp< ir 



N 



sit) 



sit) 



1 Vlr , 

1 - -TT + O — 



2 AT 



N 



where the last equality follows because l\ and v\ are the first two conditional 
moments of Lj{t). 



Recalling that — ^ 
that 



converges almost surely to 



l+mf , 
— g- 5 - we nave 



lim 



1 Wl r , 



lim 

N^oo 



exp 



i;i r 2 A?(t) 
2 (t) iV 

r 2 1 + m? 



+ o 



Af(t) 



The same argument holds for the terms where (Jj{t) = — 1. Since 
— g- 4 -, we nave 



lim 

AT— >oo 



u-i r 2 Agift) 
2 A^(t) N 



+ o 



■ exp 



r 1 — mf 
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J2 1<j (t)-NL(t) 

Finally, recall from Lemma 5.9 that — - — l -t= converges to X ~ iV(0, 

so that 



lim E 

AT— >oc 



exp 



1v 



exp 



r 2 - Z_!) 2 ^(t) 



Thus, denoting by E[- ■ ■ \o~(t)] the inner conditional expectation in (48), we 
have shown that 



lim E[- ■ ■ \cr(t)] = exp 

N— >oo 



r 2 (h-U) 2 V(t) 



2 



exp 



r z 1 + ml 



-vi 



x exp 



r 1 — 



exp 



By the Dominated Convergence Theorem, taking the limit as N — > +oo in 
(48), we can interchange the limit with the outer expectation, and the proof 
is completed. □ 



6. Conclusions and possible extensions. In this paper we have described 
propagation of financial distress in a network of firms linked by business 
relationships. 

We have proposed a model for credit contagion, based on interacting par- 
ticle systems, and we have quantified the impact of contagion on the losses 
suffered by a financial institution holding a large portfolio with positions 
issued by the firms. 

Compared to the existing literature on credit contagion, we have proposed 
a dynamic model where it is possible to describe the evolution of the indica- 
tors of financial distress. In this way we are able to compute the distribution 
of the losses in a large portfolio for any time horizon T, via a suitable version 
of the central limit theorem. 

The peculiarity of our model is the fact that the changes in rating class 
(the a variables) are related to the degree of health of the system (the 
global indicator m u ). There is a further characteristic of the firms that is 
summarized by a second variable u (a liquidity indicator) and that describes 
the ability of the firm to act as a buffer against adverse news coming from 
the market. The evolution of the pair (o~,u) depends on two parameters (5 
and 7, which indicate the strength of the interaction. 

The fact that our model leads to endogenous financial indicators that de- 
scribe the general health of the systems has allowed us to view a credit crisis 
as a microeconomic phenomenon. This has also been exemplified through 
simulation results. 
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The model we have proposed in this paper exhibits some phenomena 
having interesting financial interpretation. There are many extensions that 
could make the model more flexible and realistic, allowing also calibration to 
real data. One of them, concerning the symmetry of the model, has already 
been mentioned in Remark 2.1. Other more substantial extensions are the 
following: 

• In real applications, the variable a denoting the rating class is not binary; 
one could extend the model by taking a to be valued in a finite, totally 
ordered set. 

• One could assume the fundamental values Ui to be R + -valued, and evolv- 
ing according to the stochastic differential equation 



where / and g are given functions, the Bi(-) are independent Brownian 
motions, and Jj(-) is a pure jump process whose intensity is a function of 
LUi(t) and m^r(t). 

• An interesting extension of the above model consists in letting the func- 
tions a(-,-,-) and &(-,-,•) in (1) be random rather than deterministic; 
in particular they may depend on (possibly time-dependent) exogenous 
macroeconomic variables. 

• The mean-field assumption may be weakened by assuming that the rate 
at which Ui changes depends on an i-dependent weighted global health of 
the form 



where J: [0, l] 2 — > R is a function describing the interaction between pairs 
of firms. In other words, the ith firm "feels" the information given by the 
rating of the other firms in a nonuniform way. 

Other generalizations could be useful, in particular to introduce inhomogene- 
ity in the model. In principle, the extensions listed above could be treated 
by the same techniques used in this paper. 



dwi(t) = Ui(t)[f(m%(t)) dt + g(m%(t)) dB^t)} + dJi(t), 




APPENDIX: THE EIGENVALUES OF THE MATRIX A IN 

THEOREM 3.6 



We begin by writing down explicitly the limit matrix A: 



( 



cosh(/3) 



A = 2 




sinh(7mj) + 7mJ cosh(7m£) + 7 



sinh(/3) + m% sinh(7m£) 



sinh(7mj) 



V 



cosh(/J) + cosh(7mj) 
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sinh(/3) \ 

— cosh(7m£) 

-(cosh(/3) +cosh(7?nJ))/ 

where for the first term in the second row we have used (16). By direct 
computation, one shows that the eigenvalues of A are given by the following 
expressions: 

Ai = -2(cosh(/3) + cosh(7?<)), 
A2 = — I cosh(/3) + cosh(jm°) 



(49) + J(cosh(/3) -cosh( 7 m-)) 2 + 47 Smh( ^ 



cosh(7mj) J ' 
A3 = — < cosh(/3) + cosh(7m^) 



(cosh(/3) - cosh(77<)) 2 + 47- S 



cosh (7m J) j 

Note that these eigenvalues are all real, and that clearly Ai, A2 < 0. Moreover, 
A3 < if and only if 

(50) — <cosh 2 (7m^) 

7c 

where 7c=t5JSP)- 

(a) If 7 < 7c, then by part (i) in Theorem 3.4 we have m% = 0. In this 
case (50) holds, because 

— < l = cosh 2 (7-0). 

7c 

In this case the matrix A has three different real eigenvalues, all strictly 
negative. 

(b) If 7 = 7c, we still have m% = 0, but it is immediately seen that A3 = 0. 

(c) Finally, if 7 > 7 C , set y = ^m%\ by (15) we have 

1 7 

(51) ml = — tanh(7m^) 44> y= — tanh(y). 

7c 7c 
Then (50) is equivalent to showing that 

(52) ^<cosh 2 (y) 

7c 



48 



DAI PRA, RUNGGALDIER, SARTORI AND TOLOTTI 



and from (51) we obtain 

— = I — T~TT = ■ r? \ cosh (y) < cosh(y) < cosh 2 (y) 
7 C tanh(y) smh(y) 

because y/ sinh(y) < 1 and cosh(y) < cosh 2 (?/), since y = > if 7 > j c . 
Then, in this case too, the matrix A has three different real eigenvalues, all 
strictly negative. 
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